Exploring Essential Attributes for Detecting MicroRNA Precursors from Background Sequences
نویسندگان
چکیده
MicroRNAs (miRNAs) have been shown to play important roles in post-transcriptional gene regulation. The hairpin structure is a key characteristic of the microRNAs precursors (pre-miRNAs). How to encode their hairpin structures is a critical step to correctly detect the pre-miRNAs from background sequences, i.e., pseudo miRNA precursors. In this paper, we have proposed to encode the hairpin structures of the pre-miRNA with a set of features, which captures both the global and local structure characteristics of the pre-miRNAs. Furthermore, we find that four essential attributes are discriminatory for classifying human pre-miRNAs and background sequences with an information theory approach. The experimental results show that the number of conserved essential attributes decreases when the phylogenetic distance between the species increases. Specifically, one A-U pair, which produces the U at the start position of most mature miRNAs, in the pre-miRNAs is found to be well conserved in different species for the purpose of biogenesis.
منابع مشابه
BP Neural Network Could Help Improve Pre-miRNA Identification in Various Species
MicroRNAs (miRNAs) are a set of short (21-24 nt) noncoding RNAs that play significant regulatory roles in cells. In the past few years, research on miRNA-related problems has become a hot field of bioinformatics because of miRNAs' essential biological function. miRNA-related bioinformatics analysis is beneficial in several aspects, including the functions of miRNAs and other genes, the regulato...
متن کاملEvidence that microRNA precursors, unlike other non-coding RNAs, have lower folding free energies than random sequences
MOTIVATION Most non-coding RNAs are characterized by a specific secondary and tertiary structure that determines their function. Here, we investigate the folding energy of the secondary structure of non-coding RNA sequences, such as microRNA precursors, transfer RNAs and ribosomal RNAs in several eukaryotic taxa. Statistical biases are assessed by a randomization test, in which the predicted mi...
متن کاملIdentification of MicroRNA Precursors with Support Vector Machine and String Kernel
MicroRNAs (miRNAs) are one family of short (21-23 nt) regulatory non-coding RNAs processed from long (70-110 nt) miRNA precursors (pre-miRNAs). Identifying true and false precursors plays an important role in computational identification of miRNAs. Some numerical features have been extracted from precursor sequences and their secondary structures to suit some classification methods; however, th...
متن کاملIdentification of MicroRNA Processing Determinants by Random Mutagenesis of Arabidopsis MIR172a Precursor
MicroRNAs (miRNAs) are widespread posttranscriptional regulators of gene expression. They are processed from longer primary transcripts that contain foldback structures (reviewed in). In animals, a complex formed by Drosha and DGCR8/Pasha recognizes the transition between the single-stranded RNA sequences and the stem loop to produce the first cleavage step in miRNA biogenesis. Whereas animal p...
متن کاملExploring EFL Learners’ Use of Formulaic Sequences in Pragmatically Focused Role-play Tasks
Communicative language use largely entails regular patterns consisting of pre-constructed phrases or sequences. These sequences have been examined by many researchers to find the situation-based formulas which may help L2 learners follow a possibly more target-like speaking system. This study, therefore, explored two categories of formulaic expressions including speech formulas and situation-bo...
متن کامل